AITopics | computer science and engineering

Collaborating Authors

computer science and engineering

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ChatGpt Content detection: A new approach using xlm-roberta alignment

Tanvir, Md Tasnin, Dash, Dr Santanu Kumar, Shahnan, Ishan, Fuad, Nafis, Rahman, Tanvir, Faisal, Abdullah Al, Mamun, Asadullah Al

arXiv.org Artificial IntelligenceNov-27-2025

The challenge of separating AI-generated text from human-authored content is becoming more urgent as generative AI technologies like ChatGPT become more widely available. In this work, we address this issue by looking at both the detection of content that has been entirely generated by AI and the identification of human text that has been reworded by AI. In our work, a comprehensive methodology to detect AI- generated text using XLM-RoBERTa, a state-of-the-art multilingual transformer model. Our approach includes rigorous preprocessing, and feature extraction involving perplexity, semantic, and readability features. We fine-tuned the XLM-RoBERTa model on a balanced dataset of human and AI-generated texts and evaluated its performance. The model demonstrated high accuracy and robust performance across various text genres. Additionally, we conducted feature analysis to understand the model's decision-making process, revealing that perplexity and attention-based features are critical in differentiating between human and AI-generated texts. Our findings offer a valuable tool for maintaining academic integrity and contribute to the broader field of AI ethics by promoting transparency and accountability in AI systems. Future research directions include exploring other advanced models and expanding the dataset to enhance the model's generalizability.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2511.21009

Country:

Asia > Bangladesh (0.15)
North America > United States > Texas (0.14)

Genre: Research Report > New Finding (0.34)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.54)

Add feedback

LLM-Driven Collaborative Model for Untangling Commits via Explicit and Implicit Dependency Reasoning

Hou, Bo, Tan, Xin, Zheng, Kai, Liu, Fang, Zhu, Yinghao, Zhang, Li

arXiv.org Artificial IntelligenceNov-6-2025

Atomic commits, which address a single development concern, are a best practice in software development. In practice, however, developers often produce tangled commits that mix unrelated changes, complicating code review and maintenance. Prior untangling approaches (rule-based, feature-based, or graph-based) have made progress but typically rely on shallow signals and struggle to distinguish explicit dependencies (e.g., control/data flow) from implicit ones (e.g., semantic or conceptual relationships). In this paper, we propose ColaUntangle, a new collaborative consultation framework for commit untangling that models both explicit and implicit dependencies among code changes. ColaUntangle integrates Large Language Model (LLM)-driven agents in a multi-agent architecture: one agent specializes in explicit dependencies, another in implicit ones, and a reviewer agent synthesizes their perspectives through iterative consultation. To capture structural and contextual information, we construct Explicit and Implicit Contexts, enabling agents to reason over code relationships with both symbolic and semantic depth. We evaluate ColaUntangle on two widely-used datasets (1,612 C# and 14k Java tangled commits). Experimental results show that ColaUntangle outperforms the best-performing baseline, achieving an improvement of 44% on the C# dataset and 82% on the Java dataset. These findings highlight the potential of LLM-based collaborative frameworks for advancing automated commit untangling tasks.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.16395

Country: Asia > China (0.29)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Colorectal Cancer Histopathological Grading using Multi-Scale Federated Learning

Arafath, Md Ahasanul, Ghosh, Abhijit Kumar, Ahmed, Md Rony, Afroz, Sabrin, Hosen, Minhazul, Moon, Md Hasan, Reza, Md Tanzim, Alam, Md Ashad

arXiv.org Machine LearningNov-6-2025

Colorectal cancer (CRC) grading is a critical prognostic factor but remains hampered by inter-observer variability and the privacy constraints of multi-institutional data sharing. While deep learning offers a path to automation, centralized training models conflict with data governance regulations and neglect the diagnostic importance of multi-scale analysis. In this work, we propose a scalable, privacy-preserving federated learning (FL) framework for CRC histopathological grading that integrates multi-scale feature learning within a distributed training paradigm. Our approach employs a dual-stream ResNetRS50 backbone to concurrently capture fine-grained nuclear detail and broader tissue-level context. This architecture is integrated into a robust FL system stabilized using FedProx to mitigate client drift across heterogeneous data distributions from multiple hospitals. Extensive evaluation on the CRC-HGD dataset demonstrates that our framework achieves an overall accuracy of 83.5%, outperforming a comparable centralized model (81.6%). Crucially, the system excels in identifying the most aggressive Grade III tumors with a high recall of 87.5%, a key clinical priority to prevent dangerous false negatives. Performance further improves with higher magnification, reaching 88.0% accuracy at 40x. These results validate that our federated multi-scale approach not only preserves patient privacy but also enhances model performance and generalization. The proposed modular pipeline, with built-in preprocessing, checkpointing, and error handling, establishes a foundational step toward deployable, privacy-aware clinical AI for digital pathology.

artificial intelligence, machine learning, md ashad alam, (9 more...)

arXiv.org Machine Learning

2511.03693

Country:

Asia (0.95)
North America > United States (0.29)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology > Colorectal Cancer (0.62)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)

Add feedback

A Taxonomy of Prompt Defects in LLM Systems

Tian, Haoye, Wang, Chong, Yang, BoYang, Zhang, Lyuye, Liu, Yang

arXiv.org Artificial IntelligenceSep-19-2025

Large Language Models (LLMs) have become key components of modern software, with prompts acting as their de-facto programming interface. However, prompt design remains largely empirical and small mistakes can cascade into unreliable, insecure, or inefficient behavior. This paper presents the first systematic survey and taxonomy of prompt defects, recurring ways that prompts fail to elicit their intended behavior from LLMs. We organize defects along six dimensions: (1) Specification and Intent, (2) Input and Content, (3) Structure and Formatting, (4) Context and Memory, (5) Performance and Efficiency, and (6) Maintainability and Engineering. Each dimension is refined into fine-grained subtypes, illustrated with concrete examples and root cause analysis. Grounded in software engineering principles, we show how these defects surface in real development workflows and examine their downstream effects. For every subtype, we distill mitigation strategies that span emerging prompt engineering patterns, automated guardrails, testing harnesses, and evaluation frameworks. We then summarize these strategies in a master taxonomy that links defect, impact, and remedy. We conclude with open research challenges and a call for rigorous engineering-oriented methodologies to ensure that LLM-driven systems are dependable by design.

arxiv preprint arxiv, large language model, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2509.14404

Country: Asia > China (0.14)

Genre: Overview (1.00)

Industry: Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms

Cleland-Huang, Jane, Granadeno, Pedro Antonio Alarcon, Bernal, Arturo Miguel Russell, Hernandez, Demetrius, Murphy, Michael, Petterson, Maureen, Scheirer, Walter

arXiv.org Artificial IntelligenceJun-3-2025

Small Uncrewed Aerial Systems (sUAS) are increasingly deployed as autonomous swarms in search-and-rescue and other disaster-response scenarios. In these settings, they use computer vision (CV) to detect objects of interest and autonomously adapt their missions. However, traditional CV systems often struggle to recognize unfamiliar objects in open-world environments or to infer their relevance for mission planning. To address this, we incorporate large language models (LLMs) to reason about detected objects and their implications. While LLMs can offer valuable insights, they are also prone to hallucinations and may produce incorrect, misleading, or unsafe recommendations. To ensure safe and sensible decision-making under uncertainty, high-level decisions must be governed by cognitive guardrails. This article presents the design, simulation, and real-world integration of these guardrails for sUAS swarms in search-and-rescue missions.

computer science and engineering, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2505.23576

Country: North America > United States > Indiana (0.15)

Genre: Research Report (0.50)

Industry:

Government > Regional Government > North America Government > United States Government (0.68)
Information Technology > Security & Privacy (0.68)
Government > Military (0.66)
Transportation > Air (0.46)

Add feedback

GBDTSVM: Combined Support Vector Machine and Gradient Boosting Decision Tree Framework for efficient snoRNA-disease association prediction

Muna, Ummay Maria, Hafiz, Fahim, Biswas, Shanta, Azim, Riasat

arXiv.org Artificial IntelligenceMay-13-2025

Small nucleolar RNAs (snoRNAs) are increasingly recognized for their critical role in the pathogenesis and characterization of various human diseases. Consequently, the precise identification of snoRNA-disease associations (SDAs) is essential for the progression of diseases and the advancement of treatment strategies. However, conventional biological experimental approaches are costly, time-consuming, and resource-intensive; therefore, machine learning-based computational methods offer a promising solution to mitigate these limitations. This paper proposes a model called 'GBDTSVM', representing a novel and efficient machine learning approach for predicting snoRNA-disease associations by leveraging a Gradient Boosting Decision Tree (GBDT) and Support Vector Machine (SVM). 'GBDTSVM' effectively extracts integrated snoRNA-disease feature representations utilizing GBDT and SVM is subsequently utilized to classify and identify potential associations. Furthermore, the method enhances the accuracy of these predictions by incorporating Gaussian kernel profile similarity for both snoRNAs and diseases. Experimental evaluation of the GBDTSVM model demonstrated superior performance compared to state-of-the-art methods in the field, achieving an area under the receiver operating characteristic (AUROC) of 0.96 and an area under the precision-recall curve (AUPRC) of 0.95 on MDRF dataset. Moreover, our model shows superior performance on two more datasets named LSGT and PsnoD. Additionally, a case study on the predicted snoRNA-disease associations verified the top 10 predicted snoRNAs across nine prevalent diseases, further validating the efficacy of the GBDTSVM approach. These results underscore the model's potential as a robust tool for advancing snoRNA-related disease research. Source codes and datasets our proposed framework can be obtained from: https://github.com/mariamuna04/gbdtsvm

artificial intelligence, machine learning, similarity, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.compbiomed.2025.110219

2505.06534

Country: Asia > Bangladesh > Dhaka Division > Dhaka District > Dhaka (0.40)

Genre:

Research Report > New Finding (1.00)
Research Report > Promising Solution (0.86)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

DiffPattern-Flex: Efficient Layout Pattern Generation via Discrete Diffusion

Wang, Zixiao, Zhao, Wenqian, Shen, Yunheng, Bai, Yang, Chen, Guojin, Farnia, Farzan, Yu, Bei

arXiv.org Artificial IntelligenceMay-9-2025

--Recent advancements in layout pattern generation have been dominated by deep generative models. However, relying solely on neural networks for legality guarantees raises concerns in many practical applications. In this paper, we present DiffPattern-Flex, a novel approach designed to generate reliable layout patterns efficiently. DiffPattern-Flex incorporates a new method for generating diverse topologies using a discrete diffusion model while maintaining a lossless and compute-efficient layout representation. T o ensure legal pattern generation, we employ an optimization-based, white-box pattern assessment process based on specific design rules. Furthermore, fast sampling and efficient legalization technologies are employed to accelerate the generation process. Experimental results across various benchmarks demonstrate that DiffPattern-Flex significantly outperforms existing methods and excels at producing reliable layout patterns. ELIABLE very-large-scale integration (VLSI) layout pattern libraries form the backbone of various Design for Manufacturability (DFM) research, such as refining design rules [1]-[3], optimizing Optical Proximity Correction (OPC) techniques [4]-[6], performing lithography simulations [7]-[9], and detecting layout hotspots [10]-[12]. With the increasing demand for layout patterns in machine-learning-based lithography design, building a comprehensive and practical large-scale pattern library has become highly resource-intensive due to the extended logic-to-chip design cycle. To address this challenge, a variety of rule-based and learning-based layout pattern generation methods have been introduced. These units were then randomly selected and combined. However, this approach results in limited diversity and quantity of generated patterns. More recently, learning-based generative methods [15]-[19] have demonstrated the ability to produce diverse layout patterns at a larger scale. This work is supported by The Research Grants Council of Hong Kong SAR (No. CUHK14208021) and the MIND project (MINDXZ202404). Y unheng Shen is with Tsinghua University, Beijing, China.

artificial intelligence, design rule, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2505.04173

Country:

Asia > China > Hong Kong (0.25)
Asia > China > Beijing > Beijing (0.24)

Genre: Research Report > New Finding (0.67)

Industry:

Semiconductors & Electronics (0.68)
Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

P2P-Insole: Human Pose Estimation Using Foot Pressure Distribution and Motion Sensors

Watanabe, Atsuya, Aisuwarya, Ratna, Jing, Lei

arXiv.org Artificial IntelligenceMay-5-2025

This work presents P2P-Insole, a low-cost approach for estimating and visualizing 3D human skeletal data using insole-type sensors integrated with IMUs. Each insole, fabricated with e-textile garment techniques, costs under USD 1, making it significantly cheaper than commercial alternatives and ideal for large-scale production. Our approach uses foot pressure distribution, acceleration, and rotation data to overcome limitations, providing a lightweight, minimally intrusive, and privacy-aware solution. The system employs a Transformer model for efficient temporal feature extraction, enriched by first and second derivatives in the input stream. Including multimodal information, such as accelerometers and rotational measurements, improves the accuracy of complex motion pattern recognition. These facts are demonstrated experimentally, while error metrics show the robustness of the approach in various posture estimation tasks. This work could be the foundation for a low-cost, practical application in rehabilitation, injury prevention, and health monitoring while enabling further development through sensor optimization and expanded datasets.

artificial intelligence, estimation, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2505.00755

Country: Asia > Indonesia > Sumatra > West Sumatra (0.14)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Consumer Health (0.69)
Information Technology > Smart Houses & Appliances (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Optimizing Helmet Detection with Hybrid YOLO Pipelines: A Detailed Analysis

M, Vaikunth, D, Dejey, C, Vishaal, S, Balamurali

arXiv.org Artificial IntelligenceDec-27-2024

Helmet detection is crucial for advancing protection levels in public road traffic dynamics. This problem statement translates to an object detection task. Therefore, this paper compares recent You Only Look Once (YOLO) models in the context of helmet detection in terms of reliability and computational load. Specifically, YOLOv8, YOLOv9, and the newly released YOLOv11 have been used. Besides, a modified architectural pipeline that remarkably improves the overall performance has been proposed in this manuscript. This hybridized YOLO model (h-YOLO) has been pitted against the independent models for analysis that proves h-YOLO is preferable for helmet detection over plain YOLO models. The models were tested using a range of standard object detection benchmarks such as recall, precision, and mAP (Mean Average Precision). In addition, training and testing times were recorded to provide the overall scope of the models in a real-time detection scenario.

detection, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.5121/csit.2024.142406

2412.19467

Country:

Asia > India (0.30)
Europe (0.28)

Genre:

Research Report (0.64)
Overview (0.46)
Personal (0.46)

Industry:

Information Technology (0.93)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
Transportation > Ground > Road (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis (0.90)

Add feedback

Distributed AI in Zero-touch Provisioning for Edge Networks: Challenges and Research Directions

Hazra, Abhishek, Morichetta, Andrea, Murturi, Ilir, Lovén, Lauri, Dehury, Chinmaya Kumar, Pujol, Victor Casamayor, Donta, Praveen Kumar, Dustdar, Schahram

arXiv.org Artificial IntelligenceNov-29-2023

Zero-touch network is anticipated to inaugurate the generation of intelligent and highly flexible resource provisioning strategies where multiple service providers collaboratively offer computation and storage resources. This transformation presents substantial challenges to network administration and service providers regarding sustainability and scalability. This article combines Distributed Artificial Intelligence (DAI) with Zero-touch Provisioning (ZTP) for edge networks. This combination helps to manage network devices seamlessly and intelligently by minimizing human intervention. In addition, several advantages are also highlighted that come with incorporating Distributed AI into ZTP in the context of edge networks. Further, we draw potential research directions to foster novel studies in this field and overcome the current limitations.

computing continuum, edge network, ztp, (11 more...)

arXiv.org Artificial Intelligence

2311.17471

Country:

Europe > Austria > Vienna (0.14)
Europe > Finland > Northern Ostrobothnia > Oulu (0.05)
Europe > Estonia > Tartu County > Tartu (0.05)
(13 more...)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Telecommunications (0.93)

Technology:

Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Systems & Languages > Distributed Architectures (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback